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ALMOST SURE CONVERGENCE IN QUANTUM SPIN GLASSES 


DAVID BUZINSKI AND ELIZABETH MECKES 


Abstract. Recently, Keating, Linden, and Wells [7] showed that the density of states 
measure of a nearest-neighbor quantum spin glass model is approximately Gaussian when 
the number of particles is large. The density of states measure is the ensemble average of 
the empirical spectral measure of a random matrix; in this paper, we use concentration 
of measure and entropy techniques together with the result of [7] to show that in fact, 
the empirical spectral measure of such a random matrix is almost surely approximately 
Gaussian itself, with no ensemble averaging. We also extend this result to a spherical 
quantum spin glass model and to the more general coupling geometries investigated by 
Erdos and Schroder. 


1. Introduction and statements of results 


In the recent paper [7], Keating, Linden and Wells show that the density of states mea¬ 
sure of a quantum spin glass with nearest neighbor interactions and Gaussian coupling 
coefficients is approximately Gaussian, as the number of particles tends to infinity. More 
specifically, they considered the following random matrix model for the Hamiltonian of a 

quantum spin glass: let {Z a ,6j}i<a,6<3 be independent standard Gaussian random variables, 

l<j’<n 

and define the 2 n x 2 n random matrix H n by 

n 3 

E £ 

j =1 a,b= 1 


(1) 


H n : = 


y/9n 


where for 1 < a < 3, 

with I 2 denoting the 2x2 identity matrix, denoting the 2x2 non-trivial Pauli matrices 


: = 

'0 

1 ' 

a® : = 

'0 -i 

a® : = 

'1 0 ' 


1 

0 


i 0 


0 -1 


and the labeling cyclic so that cr^j := a[ b) . 

The density of states measure )OS for the system is the ensemble average of the spectral 
measure of H n \ that is, if {\j}i<j<2 n are the (necessarily real) eigenvalues of H n , then for 
ACM, 

rf° S (V = ^E|{i:A j £^}|. 

In other words, fi^ os (A) is the expected proportion of the eigenvalues of H n lying in the 
set A. The main result of [7] is that ^ os converges weakly to Gaussian, as n —» 00 . The 
authors go on to consider more general collections of (still independent) coupling coefficients, 
and more general coupling geometries than that of nearest-neighbor interactions. In more 
recent work, Erdos and Schroder [3] have considered still more general coupling geometries, 
and found a sharp transition in the limiting behavior of the density of states measure 
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depending on the size of the maximum degree of the underlying graph, relative to its 
number of edges. 

The purpose of this paper is to move from convergence in expectation of the spectral 
measure of H n to the considerably stronger notion of almost sure convergece. As observed in 
[7] , the extent to which ensemble averages actually manage to describe features of individual 
systems is not always clear, but is a crucial issue if one is to make meaningful use of random 
matrix models. The following result shows that the Gaussian behavior exhibited by the 
average spectral measure is indeed the typical behavior of the empirical spectral measures. 


Theorem 1. Let fi n be the spectral measure of H n and let 7 denote the standard Gaussian 
distribution. There are universal constants C, C' and c such that 


(a) 

(b) 


C 


Ed BL (p. n ,-/) < — 


dBLifln, 7 ) > 


n 


C 

ITe 


+ t 


< Ce 


—cnt 2 . 


and 


(c) with probability 1, for all sufficiently large n, 


dBLihn, 7 )< 


a 

1/6 ■ 


n 


Here dsL^hi 11 ) denotes the bounded-Lipschitz distance between probability measures, which 
metrizes the topology of weak convergence. 

In the paper [6], Keating, Linden and Wells took a different approach to understanding 
the behavior of the spectral measures of individual Hamiltonians; rather than consider a 
random matrix model, they took the coefficients in (pQ) to be deterministic, subject to a 
normalization and boundedness condition, and showed that in that case, the non-random 
spectral measures converged weakly to Gaussian. It should be possible to take that result 
as a starting point in order to obtain almost sure convergence in the Gaussian model, 
although there are various technical challenges. We instead take a rather different approach, 
combining convergence in expectation with various probabilistic techniques. 

The organization of this paper is as follows. In Section El the pointwise estimate of 
f7] on the difference between characteristic functions of p.^ os and 7 is parlayed into an 
estimate on dBL{hn° S 1 7) using Fourier analysis. In Section [3] the Gaussian concentration 
of measure phenomenon is used to show that the random variable dsL^n, 7 ) is strongly 
concentrated at its mean. Then, the expected distance between fi n and its average Pn° s 
is estimated; this is done using a combination of applications of Gaussian concentration of 
measure, entropy methods, and approximation theory, via a similar approach to the one 
taken by the second author and M. Meckes in [9]. The almost-sure convergence rate given 
immediate consequence of part |(b)| and the first Borel-Cantelli lemma, and 


in part (c) 


is an 


is therefore not discussed further. In Section 01 we consider a modification of the random 
matrix model above, in which the coefficients in H n are not independent but are drawn 
uniformly from the 9n-dimensional sphere, and show that the empirical spectral measure is 
almost surely approximately Gaussian in that setting as well. Finally, Section [5] offers some 
remarks on extensions of Theorem [I] to further related ensembles. 
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Notation and Conventions. Let the random matrix H n be defined as above, with eigen¬ 
values Ai < • • • < A2™ • The empirical spectral measure \i n of H n is defined by 

2 n 

Mn := 2~ n J2d Xj -, 

3 =1 


that is, n n is the random probability measure putting equal mass at each eigenvalue of H n . 
Its ensemble average E/r n is denoted Pn° s and is called the density of states measure. 

For probability measures /j, and v on M, the bounded-Lipschitz distance cZblOl 1 ') from 
H to v is defined by 


dBL{n,v) := sup 




fdv 


BL — 


< 1, 


where ||/||bl denotes the bounded-Lipschitz norm of /, defined by 


\\f\\BL ■= ll/Hoo + \f\L, 


with \f\i denoting the Lipschitz constant of /. The bounded-Lipschitz distance metrizes 
weak convergence of probability measures. 

If the test functions are required only to be Lipschitz and not necessarily bounded, one 
gets instead the Li-Kantorovich distance Wi(/i,v): 


Wi(n, v) \= sup 


fdii— / fdv 


: I/U<1 • 


Clearly, < Wi{n,v). 

It is the Kantorovich-Rubenstein theorem that W± is also given by 


W\(n, v) = inf 

7T 


J \x - y\dn(x,y), 


where the infimum is taken over all couplings n of the meausures /r and v. It is for this 
reason that W\ is also called the Lj-coupling distance. 

Finally, symbols such as C , c will denote universal constants independent of all parame¬ 
ters, which may change in value from one appearance to the next. 


2. Gaussian density of states in the Kantorovich distance 

The crucial ingredient in the estimation of d,BL{lin° S , 7) is the following pointwise bound 
from | 7 ] on the difference between the corresponding characteristic functions. 

Theorem 2 (Keating, Linden, and Wells). Let iin° s and 7 be as above; denote the char¬ 
acteristic function of iin° S by tjj n and the characteristic function of 7 by ip. There is a 
constant C independent of n such that for all £, 

(2) I MO - *>(01 < %■ 

y/n 

The main result of this section is the following. 

Theorem 3. Let ii^ os and 7 be as above. There is a constant c such that 

d B L(hn OS 1 7 ) < 








4 


DAVID BUZINSKI AND ELIZABETH MECKES 


Note that by defintion of the bounded-Lipschitz distance there is no loss in restricting to 
the case /(0) = 0, which we do for the remainder of the proof. 

The first step is to make a truncation argument to further restrict the class of test 
functions considered. Given a function / : R —>• R with ||/||_bl < 1 and /(0) = 0, and given 
R > 0, define the trunction f R by 


( 3 ) Ir(x) 


f(R) + [sgn(f(R))](R - x ), 
f{-R) + [sgn(f(—R))\(x + R), 
0, 


\x\ < R; 

R<x <R + \f{R)\; 

-\f(-R)\-R<x<-R] 

x <-\f(-R)\-R or x>R + \f(R)\. 


Then ||/i?||_BL < 1 and f R is supported on [—2R,2R]. 


Lemma 4. 


(a) 

(b) 


For any t > 0, 

Vn OS ({* : 1*1 > t}) < 
For f : M -A- M with ||/||bl < 1, 


c 


j{f-f R )d^° S 



Proof. A straightforward Fubini’s theorem argument (see, e.g., Section 26 of ®) gives that 

Fn° S ({® : \x\ > t}) < - J a (1 - ifn(0)d£, 

where as before V’n(?) denotes the characteristic function of fr ^ os . Adding and subtracting 
the characteristic function y?(£) of the standard Gaussian distribution and using ([2|) then 
gives 


2 

t f t 


2 2 , 

(1 - 1pn(0)df, = ^ J 2 (! - (p(£))d£ + \ j 2 (pit) - ^n{i))df + 


For part |(b)[ note that by construction, | f(x) — /r(x) \ < 1; moreover, f(x) = f R (x) for 
\x\ < R, so that 


fdf£ OS - j f R d^ OS 


< 


'|ai|>-R 


d^° S . 


Part (a) 


is now 


immediate from part |(b)| 


□ 


Now let / : M —>• M have ||/||_bl < 1 and supp(f) C [—2 R, 2R\. The next step in the proof 
of Theorem [3] is to approximate / by 


where K\ is the Fejer kernel 


1 


2vr 


—A 


fx-=f* K x , 




A 


A f sin (Ax/2) 


27t \ Ax/2 


For / as above, one can approximate in the supremum norm, as follows. 
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Lemma 5. Let f : 


have \\f\\BL < 1 and supp(f) C [— 2R, 2R\. Then 
8 log(A) + 8 log(2i?) + 6 


I/(*) - fx(x)\ < 


7T A 


Proof. By definition of f\ (using the second form of the Fejer kernel) and the fact that 

JZc K ^y) d y = L 


I f(x) ~ h(x)\ = 


1 


7r 




2 y 
A 


sin(y) 


V 


dy 


Now, if x € [—2R,2R], then using the fact that / is supported on [—2R,2R\ and is 1- 
Lipschitz yields 


7r l/(») - /aO)I < 1/0*01 


2 R) 

sin (y) 

2 /*—1 

dy + 

2 sin 2 (y) 

dy + f 

2 sin 2 (y) 


y 

J±{x-2R) 

A y 

J- 1 

A y 


+ 


/ 


±(2R-x) 


x) 

2sin 2 (y) 

roc 

dy + \f(x)\ / 

sin (y) 


A y 

J$(2R-x) 

y 


dy 


= :/ + // + III + IV + V. 


Since f(2R) = 0, 


I < 




{X ~ 2R) L dy = 2 |/(g)-/( 2 ^)| < 2 


y 


V is handled the same way. Next, since |x| < 2 R, 


II < 


r—1 


' ±(x-2R) 


A|y| 


A 


A 


dy = - log r( 2R ~ x ) ^ 


A(2 R-x) “A’ 

4(log(A) + log(2i?)) 


A 


IV is the same. Finally, using the bound 

III< , 


sin (y) 

y 


< 1 gives 


04 


If x > 2R, then by the concavity of the logarithm, 


4 f{x) - fx(x)\ < 


K *+2 R) 


${x-2 R) 


2sin 2 (y) 


Ay 


dy 


2 

< - 

~ A 


A 


log -(x + 2 R) - log ( -(x — 2 R) 


A 


< - 


2 / 4i?\ 4 


< -. 


A \ x J A 


ix < cR 2 ( -^= + i ) + 


c'log(A) 


A 


dy 


The case x < —2R is the same. 

The following technical lemma is needed in order to compare f f\d d n° S to / fxdj. 

Lemma 6. Let g : R —>• M be such that |y(£)| < min |||y, j • Then 

f R 

/ \9x{x)\da 

J-R 


□ 
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Proof. Recall that g\(x) = g * K\(x), so that 


(4) 


J r \gx(x)\dx = ^ J 


r oo 


R 

—R J —oo 
R roo 


9\ x -~ 


2w\ ( sin(u;) 


w 


dw 


dx 


< — I I mm 

0 J—oo 



C\x- ^f|l /sin(rc) 


| x — —1 \fn 
Assume that A will be chosen with A > 2. For x > 0, 


w 


dwdx. 


f mm { 


C\x — 


2 w I 

A I 


"sin(u;) 


\x — 


2 w | > 


< 


l 


A 

1 C\x- 


y/n 


w 


dw 


2 w 
A 


n 


dw + 


/ 


!(*+!) C\x- 


2 w I 
A I 


w 2 \/n 


dw + 


'Ac™ , i\ w 2 \x — 


f(a:+l) 


2w 

A 


rdw. 


Now, the first term is trivially bounded by For the second term, 

*(i +x ) c \x- 2 «i| 


l 


w 2 \/n 


-dw 


\fn 

For the final term, 


C f^ 1+x ) x + ^ , C 

< —— / - dw = 

J i '“' z 


w* 


x 1 — 


2w 


rdw = 


(l+l) w \x ^ 


\x 2 


r°° i 

./i+i - 


and so 

r-R /-oo 


0 J 0 




nun 


< 


< ell 1 


C\x- 


2 w I 
A I 


'sin(u;) 


I r 2w\ ’ 
r A I 


A(1 + x) J A 


-l)‘"=A^( log(1 + 


dwdx 


+ y log ( ^(1 + x) 


x - 


1 + X 


n 


w 



C_ 

n 


2x+1 + l log ^ { 1 +x) )-\crh) 


1 1 
„ + j 1 + 


2 
A 

clog(A) 


+ ^ (k>g(l + x) - 


1 + X 


A 


Similarly, for x < 0, 

2 C\x- ^f|\ /sin(n;) N 2 


r° / 

/ mm < ■ 

J —oo ^ 


x — 


2w I ‘ 
A I 


n 


re 


dw 


1 C(x + ^) 


Av+x) c (x +1?) , 

dw + / — v 0 ^_ A y dw + 


n 


i: 


w 2 Wn 


' 4(1+3=) 


(*+¥) 


die 


in 1 


dx 


< C(x + 1) + C 


a/™ a/™ 


x 1 — 


A(1 + x) y A 


A 


+ T lo g tA 1 + x ) 


+ 


Ax 2 


x + 1 


- log 


/2x + 1\ 

U+i ) 
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and so 



sin(w;) 

w 


2 

dwdx 


< 


l 


R (SL 

L 


2x + 1 + - log 
A 


'-(l + x) 


< cR 2 


1 


1 

y/n A 


+ 


d log(A) 
A 


2x 

A(1 + x) 



x 

x + 1 


log 


f2x + l V 

U+iy. 


□ 


Proposition 7. Let f : 


have H/Hrl < 1 and supp(f) C [—2R,2R\. Then 


f\dhn° S ~ I h d d 


< cR 2 


1 1\ d log (A) 
n X A ' 


Proof. Let ip n (£) denote the characteristic function of Pn° s and the characteristic 
function of the standard Gaussian distribution 7 . Then by the various definitions, 


f\dp 


DOS 


1 

2n 

1 

2n 

1 

2n 

1 

2n 

1 

2tt 

1 

2tt 

r-2 R 

J-2R 


- j fxd'Y 

roo r X 




ixS, 


’ — 00 J —X 

f X 1(0 (l-y 

rX / roo 

L{L f(i)e 

r X / roo f! 


/ co rX / 

j_ x my x H 


lei 

e ix ^d^ OS {x) - I e ia *d 7 (®) ) di 

-OO 

>(-0 - ¥>(“£) W 


1 - y ) dCd7(*) 




/'W e - 

—A \J—o o 


- <?(-£)) d£ 

ix *dx) (i-y) (^n(-e)-^(-o)rfe 


r2R 

/ /'(*) 

J-2R 


rX 


^n{f) ~ y(Q 
£ 


1 — 11 e ^i di d x 


where the third to last line follows by integration by parts and we have used that / is 
supported on [—2 R, 2R\ in the last two lines. 

We can now apply the result of Lemma H and the fact the ||/||bl < 1 to obtain the 
conclusion. 

□ 


We are now ready to give the proof of Theorem [3] 
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Proof of Theorem 0 Let / : 
stronger counterpart for 7 ), 


sup 

l/U<i 


I fd^ os - f fd 7 


have 


-l¥ + 


BL 


< 1. Then by Lemma 0 (and its much 


sup 

I/U<i, 

supp(/)C[— 2R,2R] 


s 


fd^ os - / fd 1 


By Lemma 0 for / with ||/||bl < 1 and support in [— 2R, 2R], 

8 log(A) + 8 log( 2 i?) + 6 


and so 


sup 

\f\L<l. 

supp(f)C[-2R,2R] 


sup | f(x) - fx(x)\ < 


fd^° S ~ / fd 1 


7T A 


< 


sup 

l/U<L 

s u pp(f)C[—2R,2R\ 


/ 


f\dhn OS - I f\dl 


+ 


161og(A) + 161og(2i2) + 12 
7tA 


Applying Proposition |T| now gives 

d BL (h DOS 


c 161og(A) + 161og(2i?) + 12 

n R2 + ttA 


nZ . 1 1\ c'log(A) 

+ cR 1 n + a + — r 1 - 


Choosing A = n and R = n 1//§ completes the proof. 


□ 


3. Concentration and average distance to average 

A crucial underpinning of the remainder of the proof is the following concentration of 
measure property of a Gaussian random vector. 

Proposition 8 (See, e.g., Ch. 1 of [ 8 ]). Let (Z^)\<k<n be a standard n-dimensional Gauss¬ 
ian random vector, and let F : M n -A M be Lipschitz with Lipschitz constant L. There are 
universal constants C , c such that 

P[|F(Z l5 ..., Z n ) - EF(Z U ..., Z n )\ > t\ < Ce~ ct2 / L2 . 

The concentration phenomenon is key both in proving the concentration of the bounded- 
Lipschitz distance from fj, n to a fixed reference measure, and in estimating Ecbl(/ 7 i, pi(( os ). 
The following lemma gives the necessary Lipschitz estimates for this approach. 


Lemma 9. Let x = {x a ^j} € 


(with, say, lexicographic ordering). Define H n (x) by 
3 n 


H n (x) := 


:= 37a £ 

v a,b= 1 1=1 


(«)„(&) 


and let p n be the spectral measure of H n (x). Let f : 
(a) the map 

XI J fdn n 

is t^L= -Lipschitz, and 


a j+ 1’ 


have 


bl < 1. Then 
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(b) for any probability measure p on M, the map 

x i-A d BL (p n ,p) 

is -Aj=-Lipschitz. 

Proof. First consider the map x i —> H n (x), and equip the space of 2 n x 2 n symmetric matrices 
with the Hilbert-Schmidt norm: 

\\A\\ h .s. ■= Ti(AA t ) = Tr(A 2 ). 

For x, x 7 € M 9n , write H n := H n (x.) and H' n := H n (pd). Then 

\\H n -H'J 2 H.S=^[{H n -H' n ?\ 

n 3 

= X X ( Xa ’ b ’i ~ X 'a,b,j)( x cAk - x c,d,k) 

j,k =1 a,b,c,d= 1 

Recall that Tr(^4®R) = Tr(yl) Tr(R), and that Tr(a^ a ^) = 0 for each of the Pauli matrices. 
If j / k, then the matrix is a tensor product, at least two of whose factors 

are Pauli matrces; that is, if j k, then 

MJb) Jc)Jd) 


= o- 


If j = k, then 




and thus 


It follows that 




2 n , a = c,b = d; 
0, otherwise. 


I H n - H' u \\h.S. = 


\ 


n 3 


on “ 9 n / 2 

X X - <,6j) 2 = ^ll x - X 'H’ 


9 n 


j=l a, 6=1 


and so the map x i —y H n is |^=-Lipschitz. 


Now consider the map H n i-a J fdp n . By definition, j fdp n = /(Aj) so 




2 n 

2 n 

J fdpn- J fdp' n 

II 

to 

1 

3 

X/(Ai)-/(A$) 

3=1 

<2 -”X|A,-A'|<2-"/ 2 

3=1 \ 




making use of the fact that / is 1-Lipschitz. 

The Hoffman-Wielandt inequality (see, e.g., [TJ Theorem VI.4.1]) gives that 


2 ~n /2 


\ X I A? ~ ^j\ 2 — 2~ n//2 ||^n — H'uWh.S.- 

\ 3=1 


and so the map H n i-a f fdp, n is 2 n / 2 -Lipschitz; this completes the proof of part (a) 
For part |(b)[ first note that by the triangle inequality for d B L, 

\d BL {p n ,P ) -c BL (p' n ,p )| < c BL {p, n ,p! n ). 
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Define a coupling 7r of p n and p! n by 

2 n 

71 — 2 n ^ 

J=l 

where Aj and A) are ordered eigenvalues of H n and H' n respectively. Then by the Kantorovich- 
Rubenstein theorem, 

dBL{Hn, Hn) < W i (//„,//„)< J\x - y\dir(x, y) = ^ ^ |Aj — A'|. 

Applying the Cauchy-Schwarz inequality and the Hoffman-Wielandt inequality exactly as 
before gives that the map H n d B i(p n , p) is 2 _n / 2 -Lipschitz; together with the Lipschitz 
estimate for x h > H n (x) given above, this completes the proof. □ 


It is thus immediate from Proposition[8]and Lemma[9]that if p is any probability measure, 
(5) P[| d BL (p n ,p) -m BL (Hn,p)\ >t]< Ce~ cnt2 , 


and so part |(b)| of Theorem Q] follows immediately from part (c 


To prove part (a), recall that in the previous section it was shown that 


dBL{p 


DOS 


C 


n ’^-n 1 / 4 ' 


It thus suffices by the triangle inequality for cLbl to show that 


( 6 ) 


^d B L(Pn,P 


DOS\ 
n ) 


= sup 

II/IIbl<i 


fdpn - / .f dp. 


DOS 

n 


< 


n 


C 

TJE- 


Observe first that / with ||/||_bl < 1, if 

X f := J fdp n - J fdp ° OS , 

then E X / = 0 and by Proposition [5] and Lemma [UJ 

¥[\X f \ >t]< Ce~ cnt2 . 

More generally, 

_ cni~ 

(7) F[\X f -X g \>t}=¥[\X f _ g \>t]<Ce 

that is, the process {Xf} indexed by functions / : R —>• R with ||/||_bl E 1 and /(0) =0 is 
a sub-Gaussian stochastic process, with respect to the distance d(f,g ) = . 

The idea at this point is to use Dudley’s entropy bound to estimate the expected supre- 
mum of this process, but to do this successfully, a series of approximations must be made 
first to reduce the size of the (currently infinite-dimensional) indexing set of the process. 
The first step is a somewhat more sophisticated truncation argument than the one which 
appeared in Section [2l which allows us to assume that our test functions are finitely sup¬ 
ported. 
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Let 


op 


denote the ^2 —> ^2 operator norm of a matrix A, and observe that the map 


x 1 — y ||LT n ||pp is 1-Lipschitz: 


I Hn 11 op || H n 11 op | E || H n H n 11 0 p 

n 3 


- 3^11 x '° 

j =1 a, 6=1 
1 n 3 

v j=l a,6=l 


>)E&) II 

a,b,j)°j °j+ll|op 


J ~ x 'a,b,M a j a)a j+l\\op- 


Now, ||A(g) B\\op = ||-A|| 0 p||-S||op and all of the Pauli matrices have operator norm 1, so 
||<7^ a)< Tj+i Hop = 1 for all a,b,j. An application of the Cauchy-Schwarz inequality thus gives 
that 


I Hn 11 op || H n 11 op | E 


A I] I] \ X aA 

\ 3 = 1 a,6=l 


. _ ™/ 12 
J X a,b,j\ 


= X — X 


It thus follows from concentration of measure that 

F[\\\H n \\ op -E\\H n \\ op \>t}<Ce- ct2 


Since 


^\\H n \\ op < 



3 n 

yi ^eizq. 6 , 

a,b= 1 j=l 


one has in particular that if t > Cy/n, then 


n\\H n \\ op > t] < Ce~ ct2 . 



One can interpret this statement as saying that if R ~ y/n it is extremely unlikely that 
H n will have any eigenvalues outside [— R, ii], and so truncation of test functions to that 
interval should not result in much loss. 

More specifically, if fn is the truncation of / to [—2 R, 2 R\ given in Equation Q of Section 
El then 


E 


J (/ - fR)d/JLn 


< C^n OS ({* : |*| > R}) ■ 


Since 

Vn 03 ({* : 1*1 > R}) = | {j : |Aj| > I?}| < P [| \H n \\ op > R ], 


it follows that 


E 


(/ ~ fR)d[ir 


< Ce 


—cnR 2 


The indexing space of the process {Xf} may thus be safely reduced to those / supported 
on \—2R, 2 R\ with R of order y/n, with an error which is exponentially small in n. 


The next step in reducing the indexing space is to approximate bounded Lipschitz test 
functions by piecewise linear ones. Given / : M —>• M with ||/||_bi, < 1 and supp(f) C 
[~2R,2R], consider the piecewise linear function g : M —>• R defined so that supp(g) C 
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[—2 R, 2 R\, and g(x) = f(x) at each x of the form —2 R + for 0 < k < m. Because / is 

1-Lipschitz, 

11/ - slice < 2R 


and so 

It follows that 

( 8 ) ^jdjgL^hni h 


\Xf-X g \< 


m 


4 R 


m 


DOS\ 
n ) 


4 R 

< E sup X„ H-h Ce 

g&G m 


-cR 2 


where the supremum is taken over the class Q of functions g : 

• ff(0) = 0; 


satisfying 


m\BL < 1; 

supp(g) C [-2R, 2R\; 
g is linear on intervals of the form 


-2R + ^,-2R+^±H 


With the reduction to Q as the indexing space of our stochastic process, it is now possible 
to apply Dudley’s entropy bound (see, e.g., the introduction of mi): 

Proposition 10 (Dudley). Let {Y x : x € M} be a centered subgaussian stochastic process 
indexed by the metric space (M, d). Then 


E sup \Y X \< K / i/log N (M, d, e)de, 

i£M Jo 

where N(M, d, e) denotes the number of e-balls (with respect to the metric d) needed to cover 
M, and K > 0 depends only on the constants of the sub-Gaussian increment condition. 

Let Q denote the index set described above. Applying Proposition HOl to {X g } g£ g gives 
that 


E sup X g <K 
g&Q Jo 


/log 


N[G,'-J±,e 




de = —= Vlog [N (£/, | • \l, e)]de. 
v n Jo 


Since (G, \ ■ \l) is just an (m + l)-dimensional normed space, standard volumetric estimates 
(see ina Lemma 2.6]) give that 


N(G,\-\ L ,e)< (- 


m +1 


-cR 2 


so that together with (|H|), we have that 

Choosing R of order y/n and m = n 2 / 3 completes the proof of and thus the proof of 
part (a) of Theorem [I] 


fm 2 R 

L K \ /-1-1- Ce 

V n m 
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4. The spherical model 


As discussed in the introduction, all of the random matrix models of quantum spin chains 
considered so far involve independent coefficients. A perhaps more geometrically natural 
alternative is to consider the random matrix 

n 3 

(9) H n'-=J2Y1 Xa ^j (T f ) °j+l> 

j =1 a, 6=1 

where the a -'' 1 are as before, but the vector of coefficients x = {:Ea,ft,j}i<a,6<3 is chosen 

T<j<n 

uniformly from the unit sphere in M 9n . While this model introduces dependence among the 
coefficients, it is still possible to prove the almost sure convergence of the empirical spectral 
measure fi n of H n to the standard Gaussian distribution, albeit without a specific rate. 

Theorem 11. For each n > l, let fj, n be the spectral measure of the random matrix H n 
defined as in ©• Then almost surely, the sequence {/r n }neN tends weakly to the standard 
Gaussian distribution, as n —>• oo. 


It should be noted that the statement above implicitly assumes some joint distribution of 
the coefficient vectors, but the theorem is true independent of what that joint distribution 
is. 

That the earlier results for i.i.d. coefficients can be extended to this dependent setting 
relies on two important properties of uniform random vectors on the sphere. The first 
is that explicit computations are still at least somewhat feasible. The second is that the 
concentration of measure phenomenon which was strongly used in the Gaussian case holds 
for random vectors on the sphere as well, as follows. 


Lemma 12 (Levy’s lemma; see [Ch. 1 of [8]). Let (x*.)i <k<n be a random vector, uniformly 
distributed on S n_1 , and let F : M n -A M be Lipschitz with Lipschitz constant L. There are 
universal constants C, c such that 

P[|F(xi,... ,x n ) -EF(xi,... ,x n )\ >t]< Ce~ cnt2/L2 . 

The following modified version of Theorem Q] holds for the spherical model; here we 
compare p, n to p,® os rather than to the Gaussian distribution. 

Theorem 13. Let p, n be the spectral measure of H n and let := Ep, n be the density of 

states measure. There are universal constants C, C' and c such that 

(a) E d B L(Tn,Tn° S ) ' C 


< 


(b) P 


dBL(Tn,h 


DOS\ 
n ) 


n 1 / 5 ’ 
C 


> 


n 1 / 6 


+ t 


< Ce 


—cnt 2 . 


and 


(c) with probability 1, for all sufficiently large n, 

dBL^n, hn OS ) < 

All of the proofs in Section [3] go through in exactly the same way, using Levy’s lemma 
in place of Proposition [8l (Note the difference in normalization: Proposition [8] is stated 
for a standard Gaussian random vector, with expected length on the order of \Jn , whereas 
Levy’s lemma is stated for a random vector on the unit sphere.) The missing element in 
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showing the almost sure convergence of fi n to the Gaussian distribution is the comparison 
of lL n° S to Gaussian, which in the case of i.i.d. Gaussian coefficients in H n followed from 
the characteristic function estimate proved in J7|. The following result gives an analog of 
their result in for the spherical model, but without a similarly good rate of convergence; this 
is the reason that we do not obtain an almost sure convergence rate of p n in the spherical 
model. 


Proposition 14. Let if n (t) denote the characteristic function of the density of states mea¬ 
sure of H n as defined in Equation ([9]) . Then for each t € M, 

,2 

, /.\ n— »oo — L. 

■0 n(t ) -> e 2 . 

To modify the approach in [7] to prove Proposition 1141 we will need to calculate expecta¬ 
tions of certain functions over the unit sphere; the following lemma gives explicit formulae. 

Lemma 15 (See [1]). Let P(x) = |xi| ai |x 2|“ 2 • • • \x n \ an . Then if X is uniformly distributed 
on S n ~ l , 

r , T(/3i)---r(^ n )T(2) 

E[p(x)1 = 

where Pi = + 1) for 1 < i < n and 

POO POO 

T(t) = / s^e^ds = 2 / r 2t ~ 1 e~ r dr. 

Jo Jo 

The crucial technical ingredient for Proposition [14] is the following. 

Lemma 16. Let {x/cIkakiv be uniformly distributed on the unit sphere in M . N . For each 
t € M, 

' N 

E cos (tx k ) 

,k =l 

Proof. We hrst show that it suffices to approximate the cosine with a second-order Taylor 
expansion. It follows from the Levy’s lemma and the fact that | cos(tXfe)| < 1 that 


N—iOO _+2 

- > e 1 . 


N 


N 


E cos (tx k ) — E cos(tx fe )l 


k =1 


k =1 


1 (tefe ) 2 

<1 

= 

E 




L 


N 


( 10 ) 


< 


JcOs(tx fc )l r 
fc=l Ufc = 1 \ 

N f U- \2 

u 


1 - 


( tx k) 2 


>1 


Lfc=l 

' N 

U 

,k =1 


l ( tx k ) 2 

>41 

2 

Jj 


(tx k y 


> 2 


cN 

< CNe~~X. 


From the trivial estimate that \z\ - ■ ■ z m — w\ - ■ ■ w m \ < Y1T=1 \ Zk ~ Wk \ ^ \ z k\i l^fcl E 1 for 
all k, it then follows that 

( 11 ) 


N 


k =1 


E < cos (tx k ) — 1 + 


{tx k f 


I 1 

i ( tx k) 2 

<1 

VI 




k =1 


COS (tXk) — 1 + 


( tx k f 


1 - 


( tx k) 2 


<1 


< 


NE(t Xl ) 4 

4! 


< 


t 4 


N ■ 4!’ 
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where the last line follows from Lemma 1151 Now, by the Cauchy-Schwarz inequality, 


N 




fc=i 


N 


1-1 


<1 


Efl 1 


k =1 


{tx k f 


( 12 ) 


< 


1 


N 


Ejl 1 


fc = l 


(tx k y 


2 \ 2 


\ 


N 

U 

Lfc=l 


1 - 


itx k f 


> 1 


cN 

< Ce~~P 


N 


\ e n ( 1 _ ( tx k ) 2 + 

\ k= 1 


(tx k y 


Expanding this last expression and using Lemma [15] gives that 
(13) 

E n 

k =1 ' 


N 'N\ ^ (N - j\ {-t) 2 H u 


j =o 

N 


E(" E 

J ' e=o 






j =0 v 7 7 v 7 £=0 v 


41V 2 


IV 


E(p (4?Vb + 3(4 


N 


m 2 ) 


f_ , 3t 4 ^ 
N 




Since this last expression is asymptotic to e * , it is in particular bounded. Combining 
equations (flOl) . (ITT)) . (fT 2 l) . and (fT3l) gives that 

N N 


E jQ cos (tx k ) — E ]J[ ( 1 
k=l k =1 

N (i (tx k ) 2 


(14) 

and it remains to analyze E ["[/)= i - 9 

By Lemma fl5l 

^ _ ( tx k ) 2 

' k =1 


( tx k y 


< 


cr 

aT’ 


TV 

E [n C 1 - 


TV 


^ (N\f-t V r(f) 


fc =0 

JV l-t 2 \k 

E 


2 i ofcwiv 


2 fc r(f + fc) 

(^) fe IV(IV — 1 ) ■ ■ ■ (AT — /c + 1 ) 


/c =0 


fc! (JV + 2A: — 2)(JV + 2fc — 4) ■ ■ ■ (iV) 


E 

/c =0 


(=fr 


k\ 


ir : (1 -1) 

nti 1 (1 + #) 


rfc 1 (1 - 

ntiMi+l) 


< 1 , 


Clearly 
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and applying Taylor’s theorem to the logarithms gives that there is a constant C such that 


rfc 1 (i - j,) 

nt; (i+S 


/k -1 


= exp 


f k -1 


E log ( 1 - + log ( 1 + V 


I v- ,'3£ C£ 2 

V i=\ v 

3k(k - 1) Ck 3 


> exp — 


2N 


N 2 


It follows that for any m < N, 
N / x 2 


EJI 1 


fc = l 


(tx k y 


— e 2 


< 

- ^ k\ 

k =0 


1 — exp — 


3 k{k - 1) Ck 3 


2 N 


t£ /3m(m — 1) Cm 3 \ 


— e 2 I 


21V 


+ 


IV 2 

2 x m +1 


Ce 2 m 2 C 
<-+ 


JV 2 / + (m+1)! 

m +1 




IV ' y/rri \2(m + 1) 
Choosing, say, nn = ["iV 1 / 4 ] completes the proof. 


+ E 


oo (t±^k 


k=m +1 


As! 


□ 


Proof of Proposition \ lf\ The proof is a straightforward modification of the one in [7], mak¬ 
ing use of Lemma [16] instead of the corresponding computation for i.i.d. Gaussian coef¬ 
ficients; below are the details for the necessary modifications, with the part of the proof 
which is identical to that of [7] omitted. 

Suppose that n is even, and make the definitions 


A : = x °,bJ<Tj a) <r?+i 

B: =J2Y1 Xa fij<rj a) <rf+ 1 

j= 1 a,6=1 

J=1 a,6=1 

j even 

j odd 

n 

A X~^ ( a ) (&) 

+(6-l)+a := 2^ X a,b,j a j a j+ 1 

n 

d \ ^ (a) (6) 

5 3(fe _ 1)+a := 2^ o-}+i 

3 = 1 

J = 1 

j even 

j odd 


Then the terms within each sum of each of the A k and B k commute, and 

9 

H n = A + B = 2^(+ + B k ). 


Define 


(fn(t) := E 


k=i 

9 


k n 


g+Afc gitBk 


vfc=l 


Observe that since all of the terms within the A k and B k commute, 


g itA k _ g*tea,6,jcr) a) y+i 

3 = 1 
^'even 


e itB k _ g»tXo,6,3+ ) o- J -+i ) 

3 =1 
j odd 
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where k = 3 (b — 1) + a. Now, since the square of any of the Pauli matrices is the identity, 
it follows from the definition of the matrix exponential in terms of power series that 

itx a b ,'O^N a-^ b 2-i \ t i ' {-L \ ( a ) (b) 

e “*, 3 ] j+i = cos(te a!bj )/2" + sm(te ajbj )fjj VjYi- 

By the symmetry of the uniform distribution on S 9 "” 1 , any term with sine factors in the 
expansion of (f> n (t) has vanishing expectation, and so by Lemma IT(J1 


4> n (t) = E 


nn COs(fXfc) 

j =1 a,b= 1 


n—>-oo — L- 

-^ e 2 . 


At this point the proof can be completed essentially identically to the proof in [7j. □ 


From Proposition ITT1 it follows that Hn° S converges weakly to the standard Gaussian 
distribution; since the bounded-Lipschitz distance is a metric for weak convergence, this 
means that 


lim d B L^n° S , 7) = 0- 

n—>• oo 


It follows from part (c) of Theorem [T3] that 


lim dBL(Hn,Vn° S 

n—>• oo 


) = o 


almost surely, and so Theorem 1111 follows. 


5. Concluding remarks 


1. In [7] , the authors consider more general distributional assumptions on the coefficients 
{Za^j}. Specifically, they consider the random matrices 

3 n 

H n : = E 

a, 6=1 j=l 


(15) 


where the {a a ,b,j} are assumed to be independent and symmetric about 0, with 


3 n 


3 n 


E E Ea E,i = 1 E E E l^| 2+5 = 0 maxE|a aAj | 2 = o f E j , 


a,6=1 j =1 


a,6=1 j =1 


for some 5 > 0. The point is that these are the conditions on the under which 

the Lyapounov central limit holds, which allows the authors to show that the point- 
wise difference between the characteristic function of and that of the Gaussian 

distribution still tends to zero. In order to obtain the same rates of convergence as 
in the Gaussian case, slightly stronger assumptions on the rate of growth of moments 
are needed; however, without further assumptions, the concentration arguments used to 
move to almost sure convergence need not apply. 

A probability measure v is said to satisfy a quadratic transportation cost inequality 
with constant a if for all probability measures fi absolutely continuous with respect to 

(16) W 2 {^ 1 v) < \J aiL(/r||u), 

where W 2 is the L 2 -Kantorivich distance and H(fi\ |u) is the relative entropy (or Kullback- 
Leibler divergence) of n with respect to u. If instead of (11 5 . one assumes that the 
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distribution of each a a ,b,j satisfies a quadratic transportation cost inequality with the 
same constant a, then one can carry out the entire program used here in the Gaussian case 
with essentially no modification, and one again obtains the almost sure convergence of the 
spectral measure to Gaussian, with a rate of in IUi-distance. The assumption that 
the coefficients oc a ^j all share this property is the most general setting of independent 
a a,b,j in which the arguments using concentration of measure can be carried out (see [5] 
for a detailed discussion). 

2. In [3], Erdos and Schroder introduced a model for quantum spin glasses with arbi¬ 
trary coupling geometry. Given a sequence of undirected graphs T n on the vertex sets 
{1,..., n} they considered Hermitian random matrices defined by 


(x) = 


3^/e(T r r 


(ij)er n 


3 

a, 6=1 


(“) A&) 


cr 'a 


3 


where the a a ,b,(ij) are are assumed to be independent centered random variables with unit 
variance, and e(T n ) denotes the number of edges in T n . They proved weak convergence 
of the density of states measure for this model to the standard Gaussian distribution 
whenever the maximal degree of a vertex in T n is negligible in comparison to e(T n ) 

A tail bound similar to Theorem [fl part |(b)| can be obtained for this model if the coef¬ 
ficients are assumed to be standard Gaussian random variables. Using a proof identical 
to the proof of Lemma 9, one can show that for x = € M 9e (U) the map 

x ->• d B L {.Pm P) 

is — 1 = -Lipschitz for any probability measure p. It then follows from the concentration 

3y e(r n ) 

of measure for standard Gaussian random variables that 

n\d BL (Pn,p) -Ed BL (p n ,p)\ >t}< Ce~ ce ^ t2 . 


Applying this estimate in particular when p is the density of states measure, the 
almost sure convergence of p n to the standard Gaussian distribution follows from the 
Borel-Cantelli Lemma and the convergence proved in |3]. 

Erdos and Schroder also described a model for the Hamiltonian of a quantum p-spin 
glasses: 


jj(Pn-glass) _ 3 - Pn/2 ^ n ^ 


- 1/2 


E E 

l<il<...<i Pn <n ai,. 


a, 


ai, 




(“l) 


a 


( a Pn) 


,=1 


They found a sharp phase transition at the threshold p = s/n between the standard 
Gaussian distribution and the Wigner semicircle law, with an explicitly described limiting 
measure at criticality. 

Using the same arguments as above, it can be shown that for x = jTa,&.(*?) € M 9e ( rr *)}, 
the map 

x ->■ d BL {p n ,p) 

is 3 _P//2 (”) ^-Lipschitz for any probability measure p. Thus, if p n « y/n the empirical 
spectral measure converges almost surely to the standard Gaussian distribution and if 
p n » yfn the empirical spectral measure converges almost surely to the semicircle law, 
while if -kt n ^ > °°> A £ (0, 00 ), the empirical spectral measure converges almost surely to 
the limiting measure described in [3]. 
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